igo 
We ¢ 
Se 


LS 


SEVIER 


Available online at www.sciencedirect.com 
ScienceDirect 


Veterinary Microbiology 120 (2007) 167-172 


veterinary 
microbiology 


www.elsevier.com/locate/vetmic 


Short communication 


Genetic distance of SARS coronavirus from 
the recent natural case 


Zhi-Gang Wang", Shu-Ping Xu”, Yan-Jun Zhang“, Qi-Yu Bao‘ 


“Zhejiang Provincial Center for Disease Prevention and Control, 17 Laozhedazhi Road, Hangzhou 310009, China 
» College of Biosystems Engineering and Food Science, University of Zhejiang, Hangzhou 310029, China 
“Institute of Biomedical Informatics, Wenzhou Medical College, Wenzhou 325000, China 


Received 14 June 2005; received in revised form 26 July 2006; accepted 10 October 2006 


Abstract 


Phylogenetic analysis of SARS coronavirus isolates based on the spike gene and protein sequence using Neighbor-Joining, 
maximum likelihood and Bayesian inference methods indicated that a recent human SARS-CoV isolate was closer to some 
human SARS-CoV isolates from earlier epidemic phase than to the SARS-CoV-like viruses isolated from wild animals during 
previous epidemic phase. A reasonable judgment based on phylogenetic relationship and sequence variations it is likely that the 
recent human SARS-CoV isolate is closer to an unknown SARS-CoV predecessor. 


© 2006 Elsevier B.V. All rights reserved. 
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1. Introduction 


SARS coronavirus (SARS-CoV) phylogeny and 
genotyping studies have processed since SARS 
emergence (Ruan et al., 2003; Tsui et al., 2003; Zhao 
et al., 2004). Genotype C and T were first suggested at 
the end of May 2003 (Li et al., 2003), and were further 
improved and named as the Yexin genotype and 
Xiaohong genotype (Wang et al., 2004). SARS virus 
was supposed to be transmitted from the wild animal to 
human being. This hypothesis was then supported by 
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identification of a SARS-CoV-like virus in wild 
animals, such as palm civet, sold in markets in south 
China, for it had more than 99% of sequence identity to 
the SARS-CoV (Guan et al., 2003), and indicating that 
the virus could have recently transferred its hosts, from 
animals to human beings. However, recent reports 
indicated that SARS-CoV was distinct from the virus in 
palm civet and no direct evidence so far to demonstrate 
if the palm civet virus was the origin of the SARS-CoV 
or if palm civets were also infected from other species 
(Stadler et al., 2003). Although unlikely, the possibility 
that humans infected these SARS-CoV positive animals 
cannot be formally excluded, and there was a report of 
SARS-CoV transmitted from human to pig (Chen et al., 
2005). Where is the SARS-CoV-like virus of palm civet 
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in the chain? Are they getting it from another animal? 
Are palm civets infecting rodents as well as humans? 
These still are not known exactly. 

Here, the evolutional relationship among the prev- 
ious epidemic and newly occurred (WHO, 2004) (at the 
end of 2003) SARS-CoVs, and the previous epidemic 
SARS-CoV-like viruses of animal source are analyzed. 


2. Materials and methods 


The complete spike glycoprotein gene sequences of 
SARS-CoVs or SARS-CoV-like viruses download 


from NCBI GenBank database. GenBank accession 
numbers see Table 1. Multiple sequences of nucleic 
acid or amino acid were aligned by ClustalW 1.83 
(Thompson et al., 1994). Phylogenetic trees were 
constructed by MEGA3.1 (Kumar et al., 2004) 
(Neighbor-Joining (NJ) for gene and _ protein 
sequences), PAUP* 4.0b2 (Swofford, 2002) (Max- 
imum likelihood for gene sequences) and MrBayes 
3.1.2 (Huelsenbeck and Ronquist, 2001) (Bayesian 
inference for gene and protein sequences). The FIPV- 
X06170 (Feline infectious peritonitis virus) was used 
as an outgroup within the spike gene data set. SARS- 
CoV and FIPV are known to be highly identical 
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Fig. 1. NJ trees of the newly occurred, animal-origin and previous epidemic SARS-CoVs. Red hollow circle indicates genotype T isolate, red 
solid circle indicates genotype C isolate, both from human in previous epidemic. Yellow solid triangle indicates animal-origin virus in previous 
epidemic. Blue solid circle indicates newly occurred virus. Purple hollow triangle indicates an out-group. Left tree: the tree was constructed using 
the p-distance of nucleotide difference. Bootstrap = 5000. The length indicated number of nucleotide difference per site of the spike gene. Right 
tree: the topologic tree was constructed using the p-distance of nucleotide difference. Bootstrap = 5000. Below table: “the genetic distance 
between SARS-CoV GDO3TO0O13 and three groups. Here, genotype C composed of 43 isolates, genotype T composed of 76 isolates, animal- 


origin virus composed of SARS-CoV SZ1, SZ3, SZ13 and SZ16. 


Table 1 


The variant locations and substitution types in the spike protein sequences 


Accession 
No. 
AY525636 
AY 304486 
AY 304487 
AY304488 
AY 304489 
AY390556 
AY394996 
AY394981 
AY394988 
AY 278487 
AY278488 
AY278554 
AY310120 
AY274119 
AP006560 
AY 283798 
AY286320 
AY429079 
AY322207 


GD03T0013 
SZ3 

S$Z13 

SZ16 

SZ1 

GZ02 

ZS-B 
HGZ8LI1-A 
JMD 

BJO2 

BJOI 
CUHK-W1 
FRA 

TOR2 
TWS 
$in2774 
ZJO1 
BJ302-8 
LY 


en a a a Se Ce GS 


RRR KR KR KR KR KA KA A eee 


— 
‘ 


= & & £2 2 Se Bee Bee ee ee ee ee Om ee Oe 


— co CS CF tee tf 26) AG GY Gy ste Fees ie 


— 
‘ 


oo ev we ee we Urea wea oS eo 
> F- FF FF FF F&F PF Fr F&F Fr PF PF KP Pe 


— 
ad 


“en RK DDTNNFRNCS ALODEODLEI 


DA. 


Pree Ge ee ee ee a Se a eS 
> SF SF PF SF SF SF SF SF SF SF Fr FF Fr KF KP 
a Sh Vl ae SIS ah ey Pet ite . , . ° 


<< se Ke KCK Ke KK KX 


— 
dl 


CLI-L9I (L007) OTT 8 ojoiqosa1y Kavulsaja, / "JD 1a BUD “D-'Z 


691 


170 Z.-G. Wang et al./ Veterinary Microbiology 120 (2007) 167-172 


GD03T0013 
100 $Z13 
SZ16 
S21 


100 


$Z3 
Sin2774 


57 b= TWS 
Bayes 
85 FRA 
LY 
sti BJ302-8 
100 CUHK-W1 
86 BJ01 
BJO2 
96 JMD 
HGZ8L1-A 
ZS-B 
GZ02 


GD03T0013 
5 $Z13 
100 SZ16 
SZ1 
$Z3 


$in2774 


67 TWS 
ML - TOR2 
LY 
st BJ302-8 
86 CUHK-W1 
55 BJ01 
BJ02 
68 JMD 
HGZ8L1-A 
ZS-B 
GZ02 


93 


| enema | 
0.001 


Fig. 2. Other model phylogenetic trees of the spike protein gene 
sequences, Bayesian inference (Bayes) tree was conducted with 
MrBayes 3.1.2 using the GIR model with 60,000 generations, 
sampling trees every 10th generation, and calculating a consensus 
tree with 25% alltrees, branch lengths indicating gamma-distributed 
rate variation across sites and a proportion of invariable sites, the 
probability of the partition indicated by the branch. Maximum 
likelihood (ML) tree was tested by the program MODELTEST 
3.7 (Posada and Crandall, 1998) to find out the best model and 


throughout the spike gene sequence (Stavrinides and 
Guttman, 2004). 


3. Results 


Neighbor-Joining (NJ) trees of the spike gene 
sequences (Fig. 1) indicated that the SARS-CoV 
(GDO3T0013) of newly occurred case is closer to the 
human SARS-CoVs detected in the previous epidemic 
early phase (such as GZ02, GZ-B and BJO2, etc) than 
to the palm civet or raccoon dog SARS-CoV-like 
viruses (SZ1, SZ3, SZ13 and SZ16) detected in the 
previous epidemic. The p-distances of new isolate 
(GDO03T0013) with GZO2 and GZ-B isolates are 
smaller than with SZ3 isolate (Fig. 1). The similar 
results of gene sequence analysis were obtained from 
other models of phylogenetic trees such as Bayesian 
inference and Maximum likelihood methods (Fig. 2). 
The phylogenetic trees of the spike protein sequences 
were also showed similar characteristics as above 
(Fig. 3). The variant locations in the spike protein and 
the substitution types occurring in these isolates were 
markedly different between the human SARS-CoVs 
and the animal-origin SARS-CoV-like viruses com- 
paring with the new isolate, the former has 5—7 mutual 
variant locations, the latter 8—9 mutual variant 
locations (Table 1). 


4. Discussion 


Our analysis is markedly different from a conclu- 
sion of previous report in SCIENCE journal (Zhao 
et al., 2004) (They claim that phylogenetic analysis of 
this S gene sequence with those from the human 
SARS-CoV and palm civet SARS-like coronavirus 
indicated that this most recent case of SARS-CoV 
(GDO03T0013) is much closer to the palm civet SARS- 
like coronavirus than to any human SARS-CoV 
detected in the previous epidemic). Their conclusion 
was cited by lately reports (Song et al., 2005; Wu et al., 
2004). Here, our opinion, the phylogenetic relation- 
ship should be cautiously interpreted. 


parameters for PAUP’ 4.0b2 to build maximum likelihood tree, 
branch lengths indicating substitutions/site. Confidence in ML tree 
was determined by analyzing 300 bootstrap replicates. 
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Fig. 3. Phylogenetic trees of the spike protein sequences, Neighbor- 
Joining (NJ) tree was conducted with MEGA 3.1 using the JTT 
mode, the confidence in NJ tree was determined by analyzing 1000 
bootstrap replicates, the branch length indicating substitutions/site. 
Bayesian inference (Bayes) tree was conducted with MrBayes 3.1.2 
using the JTT model with 50,000 generations, sampling trees every 
10th generation, and calculating a consensus tree with 25% alltrees, 
branch lengths indicated substitutions/site, the probability of the 
partition indicated by the branch. 


In this context, as a reasonable judgment based on 
phylogenetic relationship and sequence variations it is 
likely that the recent human SARS-CoV isolate is 
closer to an unknown SARS-CoV predecessor than the 
SARS-CoVs from human or SARS-CoV-like viruses 
from palm civet both detected in the previous 
epidemic. 
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